Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 721344 |
| Missing cells | 99519 |
| Missing cells (%) | 0.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 121.1 MiB |
| Average record size in memory | 176.0 B |
Variable types
| Numeric | 14 |
|---|---|
| DateTime | 1 |
| Categorical | 7 |
origin has a high cardinality: 357 distinct values | High cardinality |
dest has a high cardinality: 357 distinct values | High cardinality |
crs_dep_time is highly correlated with wheels_off and 2 other fields | High correlation |
dep_delay is highly correlated with arr_delay | High correlation |
wheels_off is highly correlated with crs_dep_time and 2 other fields | High correlation |
wheels_on is highly correlated with crs_dep_time and 2 other fields | High correlation |
crs_arr_time is highly correlated with crs_dep_time and 2 other fields | High correlation |
arr_delay is highly correlated with dep_delay and 1 other fields | High correlation |
crs_elapsed_time is highly correlated with actual_elapsed_time and 2 other fields | High correlation |
actual_elapsed_time is highly correlated with crs_elapsed_time and 2 other fields | High correlation |
air_time is highly correlated with crs_elapsed_time and 2 other fields | High correlation |
distance is highly correlated with crs_elapsed_time and 2 other fields | High correlation |
delayed is highly correlated with arr_delay | High correlation |
crs_dep_time is highly correlated with wheels_off and 2 other fields | High correlation |
dep_delay is highly correlated with arr_delay and 1 other fields | High correlation |
wheels_off is highly correlated with crs_dep_time and 2 other fields | High correlation |
wheels_on is highly correlated with crs_dep_time and 2 other fields | High correlation |
crs_arr_time is highly correlated with crs_dep_time and 2 other fields | High correlation |
arr_delay is highly correlated with dep_delay and 1 other fields | High correlation |
crs_elapsed_time is highly correlated with actual_elapsed_time and 2 other fields | High correlation |
actual_elapsed_time is highly correlated with crs_elapsed_time and 2 other fields | High correlation |
air_time is highly correlated with crs_elapsed_time and 2 other fields | High correlation |
distance is highly correlated with crs_elapsed_time and 2 other fields | High correlation |
delayed is highly correlated with dep_delay and 1 other fields | High correlation |
crs_dep_time is highly correlated with wheels_off and 2 other fields | High correlation |
dep_delay is highly correlated with arr_delay | High correlation |
wheels_off is highly correlated with crs_dep_time and 2 other fields | High correlation |
wheels_on is highly correlated with crs_dep_time and 2 other fields | High correlation |
crs_arr_time is highly correlated with crs_dep_time and 2 other fields | High correlation |
arr_delay is highly correlated with dep_delay and 1 other fields | High correlation |
crs_elapsed_time is highly correlated with actual_elapsed_time and 2 other fields | High correlation |
actual_elapsed_time is highly correlated with crs_elapsed_time and 2 other fields | High correlation |
air_time is highly correlated with crs_elapsed_time and 2 other fields | High correlation |
distance is highly correlated with crs_elapsed_time and 2 other fields | High correlation |
delayed is highly correlated with arr_delay | High correlation |
dep_delay is highly correlated with arr_delay | High correlation |
distance is highly correlated with air_time and 2 other fields | High correlation |
wheels_on is highly correlated with crs_arr_time and 2 other fields | High correlation |
crs_arr_time is highly correlated with wheels_on and 2 other fields | High correlation |
df_index is highly correlated with week | High correlation |
arr_delay is highly correlated with dep_delay | High correlation |
air_time is highly correlated with distance and 2 other fields | High correlation |
week is highly correlated with df_index | High correlation |
wheels_off is highly correlated with wheels_on and 2 other fields | High correlation |
actual_elapsed_time is highly correlated with distance and 2 other fields | High correlation |
crs_dep_time is highly correlated with wheels_on and 2 other fields | High correlation |
crs_elapsed_time is highly correlated with distance and 2 other fields | High correlation |
dep_delay has 11772 (1.6%) missing values | Missing |
taxi_out has 11624 (1.6%) missing values | Missing |
wheels_off has 11624 (1.6%) missing values | Missing |
wheels_on has 11950 (1.7%) missing values | Missing |
taxi_in has 11950 (1.7%) missing values | Missing |
arr_delay has 13704 (1.9%) missing values | Missing |
actual_elapsed_time has 13447 (1.9%) missing values | Missing |
air_time has 13447 (1.9%) missing values | Missing |
df_index has unique values | Unique |
dep_delay has 35060 (4.9%) zeros | Zeros |
arr_delay has 13971 (1.9%) zeros | Zeros |
Reproduction
| Analysis started | 2021-08-22 23:38:55.967045 |
|---|---|
| Analysis finished | 2021-08-22 23:42:52.878756 |
| Duration | 3 minutes and 56.91 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 721344 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3604130.289 |
| Minimum | 18 |
|---|---|
| Maximum | 7213425 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 360687.3 |
| Q1 | 1802147.5 |
| median | 3601846 |
| Q3 | 5406346.25 |
| 95-th percentile | 6848931.7 |
| Maximum | 7213425 |
| Range | 7213407 |
| Interquartile range (IQR) | 3604198.75 |
Descriptive statistics
| Standard deviation | 2081222.031 |
|---|---|
| Coefficient of variation (CV) | 0.5774547155 |
| Kurtosis | -1.200276611 |
| Mean | 3604130.289 |
| Median Absolute Deviation (MAD) | 1802139 |
| Skewness | 0.001414650665 |
| Sum | 2.599817759 × 1012 |
| Variance | 4.331485142 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3149822 | 1 | < 0.1% |
| 5826486 | 1 | < 0.1% |
| 7161090 | 1 | < 0.1% |
| 354319 | 1 | < 0.1% |
| 2272511 | 1 | < 0.1% |
| 626851 | 1 | < 0.1% |
| 4361467 | 1 | < 0.1% |
| 4359418 | 1 | < 0.1% |
| 3316985 | 1 | < 0.1% |
| 2625989 | 1 | < 0.1% |
| Other values (721334) | 721334 |
| Value | Count | Frequency (%) |
| 18 | 1 | |
| 32 | 1 | |
| 37 | 1 | |
| 44 | 1 | |
| 51 | 1 | |
| 68 | 1 | |
| 94 | 1 | |
| 114 | 1 | |
| 127 | 1 | |
| 142 | 1 |
| Value | Count | Frequency (%) |
| 7213425 | 1 | |
| 7213377 | 1 | |
| 7213358 | 1 | |
| 7213337 | 1 | |
| 7213323 | 1 | |
| 7213316 | 1 | |
| 7213308 | 1 | |
| 7213305 | 1 | |
| 7213299 | 1 | |
| 7213296 | 1 |
fl_date
Date
| Distinct | 365 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| Minimum | 2018-01-01 00:00:00 |
|---|---|
| Maximum | 2018-12-31 00:00:00 |
op_carrier
Categorical
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| WN | |
|---|---|
| DL | |
| AA | |
| OO | |
| UA | |
| Other values (13) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1442688 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AS |
|---|---|
| 2nd row | EV |
| 3rd row | OO |
| 4th row | HA |
| 5th row | WN |
Common Values
| Value | Count | Frequency (%) |
| WN | 135087 | |
| DL | 94694 | |
| AA | 92050 | |
| OO | 77427 | |
| UA | 62102 | |
| YX | 31511 | 4.4% |
| B6 | 30512 | 4.2% |
| MQ | 29577 | 4.1% |
| OH | 27888 | 3.9% |
| AS | 24577 | 3.4% |
| Other values (8) | 115919 |
Length
| Value | Count | Frequency (%) |
| wn | 135087 | |
| dl | 94694 | |
| aa | 92050 | |
| oo | 77427 | |
| ua | 62102 | |
| yx | 31511 | 4.4% |
| b6 | 30512 | 4.2% |
| mq | 29577 | 4.1% |
| oh | 27888 | 3.9% |
| as | 24577 | 3.4% |
| Other values (8) | 115919 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 279226 | |
| O | 182742 | |
| N | 152866 | |
| W | 135087 | |
| D | 94694 | 6.6% |
| L | 94694 | 6.6% |
| U | 62102 | 4.3% |
| Y | 52948 | 3.7% |
| E | 44941 | 3.1% |
| V | 43778 | 3.0% |
| Other values (12) | 299610 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1366261 | |
| Decimal Number | 76427 | 5.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 279226 | |
| O | 182742 | |
| N | 152866 | |
| W | 135087 | |
| D | 94694 | 6.9% |
| L | 94694 | 6.9% |
| U | 62102 | 4.5% |
| Y | 52948 | 3.9% |
| E | 44941 | 3.3% |
| V | 43778 | 3.2% |
| Other values (9) | 223183 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 36412 | |
| 6 | 30512 | |
| 4 | 9503 | 12.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1366261 | |
| Common | 76427 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 279226 | |
| O | 182742 | |
| N | 152866 | |
| W | 135087 | |
| D | 94694 | 6.9% |
| L | 94694 | 6.9% |
| U | 62102 | 4.5% |
| Y | 52948 | 3.9% |
| E | 44941 | 3.3% |
| V | 43778 | 3.2% |
| Other values (9) | 223183 |
Common
| Value | Count | Frequency (%) |
| 9 | 36412 | |
| 6 | 30512 | |
| 4 | 9503 | 12.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1442688 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 279226 | |
| O | 182742 | |
| N | 152866 | |
| W | 135087 | |
| D | 94694 | 6.6% |
| L | 94694 | 6.6% |
| U | 62102 | 4.3% |
| Y | 52948 | 3.7% |
| E | 44941 | 3.1% |
| V | 43778 | 3.0% |
| Other values (12) | 299610 |
| Distinct | 357 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| ATL | 38995 |
|---|---|
| ORD | 33262 |
| DFW | 27929 |
| DEN | 23622 |
| CLT | 23266 |
| Other values (352) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2164032 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SFO |
|---|---|
| 2nd row | ORD |
| 3rd row | DEN |
| 4th row | LIH |
| 5th row | MCO |
Common Values
| Value | Count | Frequency (%) |
| ATL | 38995 | 5.4% |
| ORD | 33262 | 4.6% |
| DFW | 27929 | 3.9% |
| DEN | 23622 | 3.3% |
| CLT | 23266 | 3.2% |
| LAX | 22215 | 3.1% |
| SFO | 17722 | 2.5% |
| IAH | 17603 | 2.4% |
| PHX | 17429 | 2.4% |
| LGA | 17049 | 2.4% |
| Other values (347) | 482252 |
Length
| Value | Count | Frequency (%) |
| atl | 38995 | 5.4% |
| ord | 33262 | 4.6% |
| dfw | 27929 | 3.9% |
| den | 23622 | 3.3% |
| clt | 23266 | 3.2% |
| lax | 22215 | 3.1% |
| sfo | 17722 | 2.5% |
| iah | 17603 | 2.4% |
| phx | 17429 | 2.4% |
| lga | 17049 | 2.4% |
| Other values (347) | 482252 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 241438 | 11.2% |
| L | 205975 | 9.5% |
| S | 176144 | 8.1% |
| D | 169572 | 7.8% |
| T | 123311 | 5.7% |
| O | 116296 | 5.4% |
| C | 109935 | 5.1% |
| M | 96136 | 4.4% |
| F | 90890 | 4.2% |
| W | 85445 | 3.9% |
| Other values (16) | 748890 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2164032 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 241438 | 11.2% |
| L | 205975 | 9.5% |
| S | 176144 | 8.1% |
| D | 169572 | 7.8% |
| T | 123311 | 5.7% |
| O | 116296 | 5.4% |
| C | 109935 | 5.1% |
| M | 96136 | 4.4% |
| F | 90890 | 4.2% |
| W | 85445 | 3.9% |
| Other values (16) | 748890 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2164032 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 241438 | 11.2% |
| L | 205975 | 9.5% |
| S | 176144 | 8.1% |
| D | 169572 | 7.8% |
| T | 123311 | 5.7% |
| O | 116296 | 5.4% |
| C | 109935 | 5.1% |
| M | 96136 | 4.4% |
| F | 90890 | 4.2% |
| W | 85445 | 3.9% |
| Other values (16) | 748890 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2164032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 241438 | 11.2% |
| L | 205975 | 9.5% |
| S | 176144 | 8.1% |
| D | 169572 | 7.8% |
| T | 123311 | 5.7% |
| O | 116296 | 5.4% |
| C | 109935 | 5.1% |
| M | 96136 | 4.4% |
| F | 90890 | 4.2% |
| W | 85445 | 3.9% |
| Other values (16) | 748890 |
| Distinct | 357 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| ATL | 38998 |
|---|---|
| ORD | 33277 |
| DFW | 27926 |
| DEN | 23519 |
| CLT | 23211 |
| Other values (352) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2164032 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | LAX |
|---|---|
| 2nd row | SGF |
| 3rd row | SUN |
| 4th row | HNL |
| 5th row | ALB |
Common Values
| Value | Count | Frequency (%) |
| ATL | 38998 | 5.4% |
| ORD | 33277 | 4.6% |
| DFW | 27926 | 3.9% |
| DEN | 23519 | 3.3% |
| CLT | 23211 | 3.2% |
| LAX | 21953 | 3.0% |
| PHX | 17562 | 2.4% |
| IAH | 17554 | 2.4% |
| SFO | 17495 | 2.4% |
| LGA | 17067 | 2.4% |
| Other values (347) | 482782 |
Length
| Value | Count | Frequency (%) |
| atl | 38998 | 5.4% |
| ord | 33277 | 4.6% |
| dfw | 27926 | 3.9% |
| den | 23519 | 3.3% |
| clt | 23211 | 3.2% |
| lax | 21953 | 3.0% |
| phx | 17562 | 2.4% |
| iah | 17554 | 2.4% |
| sfo | 17495 | 2.4% |
| lga | 17067 | 2.4% |
| Other values (347) | 482782 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 241375 | 11.2% |
| L | 205560 | 9.5% |
| S | 175611 | 8.1% |
| D | 169840 | 7.8% |
| T | 123427 | 5.7% |
| O | 116520 | 5.4% |
| C | 109799 | 5.1% |
| M | 96067 | 4.4% |
| F | 90573 | 4.2% |
| W | 85276 | 3.9% |
| Other values (16) | 749984 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2164032 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 241375 | 11.2% |
| L | 205560 | 9.5% |
| S | 175611 | 8.1% |
| D | 169840 | 7.8% |
| T | 123427 | 5.7% |
| O | 116520 | 5.4% |
| C | 109799 | 5.1% |
| M | 96067 | 4.4% |
| F | 90573 | 4.2% |
| W | 85276 | 3.9% |
| Other values (16) | 749984 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2164032 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 241375 | 11.2% |
| L | 205560 | 9.5% |
| S | 175611 | 8.1% |
| D | 169840 | 7.8% |
| T | 123427 | 5.7% |
| O | 116520 | 5.4% |
| C | 109799 | 5.1% |
| M | 96067 | 4.4% |
| F | 90573 | 4.2% |
| W | 85276 | 3.9% |
| Other values (16) | 749984 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2164032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 241375 | 11.2% |
| L | 205560 | 9.5% |
| S | 175611 | 8.1% |
| D | 169840 | 7.8% |
| T | 123427 | 5.7% |
| O | 116520 | 5.4% |
| C | 109799 | 5.1% |
| M | 96067 | 4.4% |
| F | 90573 | 4.2% |
| W | 85276 | 3.9% |
| Other values (16) | 749984 |
| Distinct | 1331 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1330.840573 |
| Minimum | 1 |
|---|---|
| Maximum | 2359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 605 |
| Q1 | 915 |
| median | 1323 |
| Q3 | 1735 |
| 95-th percentile | 2130 |
| Maximum | 2359 |
| Range | 2358 |
| Interquartile range (IQR) | 820 |
Descriptive statistics
| Standard deviation | 490.7052696 |
|---|---|
| Coefficient of variation (CV) | 0.368718297 |
| Kurtosis | -1.03596056 |
| Mean | 1330.840573 |
| Median Absolute Deviation (MAD) | 412 |
| Skewness | 0.06418971409 |
| Sum | 959993862 |
| Variance | 240791.6616 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 600 | 14320 | 2.0% |
| 700 | 9834 | 1.4% |
| 800 | 5815 | 0.8% |
| 830 | 4496 | 0.6% |
| 630 | 4193 | 0.6% |
| 900 | 4158 | 0.6% |
| 1000 | 4069 | 0.6% |
| 730 | 4049 | 0.6% |
| 1700 | 3843 | 0.5% |
| 1200 | 3792 | 0.5% |
| Other values (1321) | 662775 |
| Value | Count | Frequency (%) |
| 1 | 12 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 8 | < 0.1% |
| 4 | 20 | < 0.1% |
| 5 | 72 | |
| 6 | 5 | < 0.1% |
| 7 | 8 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 9 | < 0.1% |
| 10 | 47 |
| Value | Count | Frequency (%) |
| 2359 | 631 | |
| 2358 | 53 | < 0.1% |
| 2357 | 50 | < 0.1% |
| 2356 | 46 | < 0.1% |
| 2355 | 299 | |
| 2354 | 35 | < 0.1% |
| 2353 | 32 | < 0.1% |
| 2352 | 15 | < 0.1% |
| 2351 | 22 | < 0.1% |
| 2350 | 174 | < 0.1% |
dep_delay
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 994 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 11772 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.982390793 |
| Minimum | -57 |
|---|---|
| Maximum | 1861 |
| Zeros | 35060 |
| Zeros (%) | 4.9% |
| Negative | 429744 |
| Negative (%) | 59.6% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | -57 |
|---|---|
| 5-th percentile | -10 |
| Q1 | -5 |
| median | -2 |
| Q3 | 7 |
| 95-th percentile | 73 |
| Maximum | 1861 |
| Range | 1918 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 44.81726599 |
|---|---|
| Coefficient of variation (CV) | 4.489632486 |
| Kurtosis | 165.7336724 |
| Mean | 9.982390793 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 9.509406521 |
| Sum | 7083225 |
| Variance | 2008.587331 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -5 | 55742 | 7.7% |
| -4 | 54559 | 7.6% |
| -3 | 53272 | 7.4% |
| -2 | 47828 | 6.6% |
| -6 | 44858 | 6.2% |
| -1 | 41606 | 5.8% |
| -7 | 36590 | 5.1% |
| 0 | 35060 | 4.9% |
| -8 | 27533 | 3.8% |
| -9 | 20193 | 2.8% |
| Other values (984) | 292331 |
| Value | Count | Frequency (%) |
| -57 | 1 | < 0.1% |
| -51 | 1 | < 0.1% |
| -49 | 1 | < 0.1% |
| -47 | 1 | < 0.1% |
| -45 | 2 | |
| -43 | 1 | < 0.1% |
| -42 | 1 | < 0.1% |
| -41 | 1 | < 0.1% |
| -40 | 4 | |
| -39 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1861 | 1 | |
| 1576 | 1 | |
| 1559 | 1 | |
| 1531 | 1 | |
| 1528 | 1 | |
| 1522 | 1 | |
| 1518 | 1 | |
| 1486 | 1 | |
| 1460 | 1 | |
| 1417 | 1 |
| Distinct | 169 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11624 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.3887829 |
| Minimum | 1 |
|---|---|
| Maximum | 180 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 11 |
| median | 15 |
| Q3 | 20 |
| 95-th percentile | 35 |
| Maximum | 180 |
| Range | 179 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 9.878590816 |
|---|---|
| Coefficient of variation (CV) | 0.5681013371 |
| Kurtosis | 20.40220955 |
| Mean | 17.3887829 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 3.236346317 |
| Sum | 12341167 |
| Variance | 97.58655651 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 53729 | 7.4% |
| 13 | 52366 | 7.3% |
| 11 | 52173 | 7.2% |
| 14 | 48393 | 6.7% |
| 10 | 46901 | 6.5% |
| 15 | 44060 | 6.1% |
| 16 | 39070 | 5.4% |
| 9 | 36981 | 5.1% |
| 17 | 34497 | 4.8% |
| 18 | 30384 | 4.2% |
| Other values (159) | 271166 |
| Value | Count | Frequency (%) |
| 1 | 10 | < 0.1% |
| 2 | 20 | < 0.1% |
| 3 | 135 | < 0.1% |
| 4 | 485 | 0.1% |
| 5 | 1852 | 0.3% |
| 6 | 6368 | 0.9% |
| 7 | 14327 | 2.0% |
| 8 | 25268 | |
| 9 | 36981 | |
| 10 | 46901 |
| Value | Count | Frequency (%) |
| 180 | 1 | < 0.1% |
| 177 | 1 | < 0.1% |
| 176 | 1 | < 0.1% |
| 175 | 1 | < 0.1% |
| 173 | 1 | < 0.1% |
| 166 | 1 | < 0.1% |
| 165 | 3 | |
| 163 | 2 | |
| 162 | 2 | |
| 161 | 2 |
wheels_off
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1431 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 11624 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1359.002085 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 616 |
| Q1 | 932 |
| median | 1341 |
| Q3 | 1759 |
| 95-th percentile | 2155 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 827 |
Descriptive statistics
| Standard deviation | 505.8791287 |
|---|---|
| Coefficient of variation (CV) | 0.3722430849 |
| Kurtosis | -0.9238495754 |
| Mean | 1359.002085 |
| Median Absolute Deviation (MAD) | 414 |
| Skewness | -0.005912553281 |
| Sum | 964510960 |
| Variance | 255913.6928 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 610 | 1203 | 0.2% |
| 609 | 1178 | 0.2% |
| 611 | 1172 | 0.2% |
| 608 | 1155 | 0.2% |
| 613 | 1153 | 0.2% |
| 612 | 1144 | 0.2% |
| 607 | 1103 | 0.2% |
| 614 | 1101 | 0.2% |
| 710 | 1030 | 0.1% |
| 709 | 1022 | 0.1% |
| Other values (1421) | 698459 | |
| (Missing) | 11624 | 1.6% |
| Value | Count | Frequency (%) |
| 1 | 129 | |
| 2 | 107 | |
| 3 | 99 | |
| 4 | 107 | |
| 5 | 114 | |
| 6 | 125 | |
| 7 | 103 | |
| 8 | 97 | |
| 9 | 90 | |
| 10 | 121 |
| Value | Count | Frequency (%) |
| 2400 | 85 | |
| 2359 | 132 | |
| 2358 | 103 | |
| 2357 | 107 | |
| 2356 | 114 | |
| 2355 | 110 | |
| 2354 | 101 | |
| 2353 | 115 | |
| 2352 | 127 | |
| 2351 | 102 |
wheels_on
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 11950 |
| Missing (%) | 1.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1463.093707 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 643 |
| Q1 | 1045 |
| median | 1503 |
| Q3 | 1912 |
| 95-th percentile | 2249 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 867 |
Descriptive statistics
| Standard deviation | 533.6179693 |
|---|---|
| Coefficient of variation (CV) | 0.3647189287 |
| Kurtosis | -0.4533442863 |
| Mean | 1463.093707 |
| Median Absolute Deviation (MAD) | 423 |
| Skewness | -0.3279631239 |
| Sum | 1037909897 |
| Variance | 284748.1372 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2104 | 815 | 0.1% |
| 1844 | 799 | 0.1% |
| 1650 | 797 | 0.1% |
| 1854 | 794 | 0.1% |
| 1850 | 791 | 0.1% |
| 2106 | 791 | 0.1% |
| 1904 | 787 | 0.1% |
| 1644 | 786 | 0.1% |
| 1710 | 785 | 0.1% |
| 1615 | 785 | 0.1% |
| Other values (1430) | 701464 | |
| (Missing) | 11950 | 1.7% |
| Value | Count | Frequency (%) |
| 1 | 398 | |
| 2 | 329 | |
| 3 | 323 | |
| 4 | 299 | |
| 5 | 294 | |
| 6 | 318 | |
| 7 | 305 | |
| 8 | 299 | |
| 9 | 281 | |
| 10 | 261 |
| Value | Count | Frequency (%) |
| 2400 | 249 | |
| 2359 | 355 | |
| 2358 | 349 | |
| 2357 | 390 | |
| 2356 | 368 | |
| 2355 | 379 | |
| 2354 | 349 | |
| 2353 | 366 | |
| 2352 | 390 | |
| 2351 | 400 |
| Distinct | 150 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11950 |
| Missing (%) | 1.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.601802383 |
| Minimum | 1 |
|---|---|
| Maximum | 258 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 18 |
| Maximum | 258 |
| Range | 257 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 6.071801211 |
|---|---|
| Coefficient of variation (CV) | 0.7987317882 |
| Kurtosis | 51.51948995 |
| Mean | 7.601802383 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 4.620734385 |
| Sum | 5392673 |
| Variance | 36.86676995 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 107022 | |
| 5 | 100850 | |
| 6 | 82150 | |
| 3 | 80221 | |
| 7 | 65603 | |
| 8 | 48874 | |
| 9 | 37974 | 5.3% |
| 10 | 29575 | 4.1% |
| 2 | 26371 | 3.7% |
| 11 | 23274 | 3.2% |
| Other values (140) | 107480 |
| Value | Count | Frequency (%) |
| 1 | 1771 | 0.2% |
| 2 | 26371 | 3.7% |
| 3 | 80221 | |
| 4 | 107022 | |
| 5 | 100850 | |
| 6 | 82150 | |
| 7 | 65603 | |
| 8 | 48874 | |
| 9 | 37974 | 5.3% |
| 10 | 29575 | 4.1% |
| Value | Count | Frequency (%) |
| 258 | 2 | |
| 182 | 1 | |
| 180 | 1 | |
| 177 | 1 | |
| 176 | 1 | |
| 173 | 1 | |
| 161 | 1 | |
| 159 | 1 | |
| 158 | 1 | |
| 153 | 1 |
| Distinct | 1405 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1487.061976 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 714 |
| Q1 | 1101 |
| median | 1516 |
| Q3 | 1919 |
| 95-th percentile | 2255 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 818 |
Descriptive statistics
| Standard deviation | 518.5120298 |
|---|---|
| Coefficient of variation (CV) | 0.3486821923 |
| Kurtosis | -0.4636437834 |
| Mean | 1487.061976 |
| Median Absolute Deviation (MAD) | 409 |
| Skewness | -0.3024625396 |
| Sum | 1072683234 |
| Variance | 268854.725 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2100 | 2115 | 0.3% |
| 1700 | 2044 | 0.3% |
| 1900 | 2002 | 0.3% |
| 1130 | 1996 | 0.3% |
| 1855 | 1981 | 0.3% |
| 2115 | 1952 | 0.3% |
| 2125 | 1909 | 0.3% |
| 2120 | 1870 | 0.3% |
| 1925 | 1852 | 0.3% |
| 900 | 1839 | 0.3% |
| Other values (1395) | 701784 |
| Value | Count | Frequency (%) |
| 1 | 247 | < 0.1% |
| 2 | 187 | < 0.1% |
| 3 | 220 | < 0.1% |
| 4 | 213 | < 0.1% |
| 5 | 746 | |
| 6 | 168 | < 0.1% |
| 7 | 178 | < 0.1% |
| 8 | 154 | < 0.1% |
| 9 | 247 | < 0.1% |
| 10 | 643 |
| Value | Count | Frequency (%) |
| 2400 | 10 | < 0.1% |
| 2359 | 1224 | |
| 2358 | 547 | 0.1% |
| 2357 | 587 | |
| 2356 | 581 | |
| 2355 | 1394 | |
| 2354 | 529 | 0.1% |
| 2353 | 403 | 0.1% |
| 2352 | 333 | < 0.1% |
| 2351 | 383 | 0.1% |
arr_delay
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 1015 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 13704 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.045278673 |
| Minimum | -102 |
|---|---|
| Maximum | 1861 |
| Zeros | 13971 |
| Zeros (%) | 1.9% |
| Negative | 442362 |
| Negative (%) | 61.3% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | -102 |
|---|---|
| 5-th percentile | -26 |
| Q1 | -14 |
| median | -6 |
| Q3 | 8 |
| 95-th percentile | 73 |
| Maximum | 1861 |
| Range | 1963 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 46.9310833 |
|---|---|
| Coefficient of variation (CV) | 9.301980394 |
| Kurtosis | 138.8686884 |
| Mean | 5.045278673 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 8.388702025 |
| Sum | 3570241 |
| Variance | 2202.52658 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -11 | 21072 | 2.9% |
| -10 | 20992 | 2.9% |
| -9 | 20958 | 2.9% |
| -8 | 20805 | 2.9% |
| -12 | 20680 | 2.9% |
| -7 | 20293 | 2.8% |
| -13 | 20221 | 2.8% |
| -6 | 19527 | 2.7% |
| -14 | 19199 | 2.7% |
| -5 | 18677 | 2.6% |
| Other values (1005) | 505216 |
| Value | Count | Frequency (%) |
| -102 | 1 | < 0.1% |
| -91 | 1 | < 0.1% |
| -83 | 1 | < 0.1% |
| -81 | 1 | < 0.1% |
| -77 | 1 | < 0.1% |
| -74 | 4 | |
| -73 | 2 | |
| -72 | 2 | |
| -71 | 4 | |
| -70 | 3 |
| Value | Count | Frequency (%) |
| 1861 | 1 | |
| 1576 | 1 | |
| 1558 | 1 | |
| 1543 | 1 | |
| 1523 | 1 | |
| 1515 | 1 | |
| 1506 | 1 | |
| 1477 | 1 | |
| 1449 | 1 | |
| 1430 | 1 |
cancelled
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| 0.0 | |
|---|---|
| 1.0 | 11700 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2164032 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 709644 | |
| 1.0 | 11700 | 1.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 709644 | |
| 1.0 | 11700 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1430988 | |
| . | 721344 | |
| 1 | 11700 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1442688 | |
| Other Punctuation | 721344 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1430988 | |
| 1 | 11700 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 721344 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2164032 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1430988 | |
| . | 721344 | |
| 1 | 11700 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2164032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1430988 | |
| . | 721344 | |
| 1 | 11700 | 0.5% |
diverted
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| 0.0 | |
|---|---|
| 1.0 | 1747 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2164032 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 719597 | |
| 1.0 | 1747 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 719597 | |
| 1.0 | 1747 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1440941 | |
| . | 721344 | |
| 1 | 1747 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1442688 | |
| Other Punctuation | 721344 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1440941 | |
| 1 | 1747 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 721344 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2164032 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1440941 | |
| . | 721344 | |
| 1 | 1747 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2164032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1440941 | |
| . | 721344 | |
| 1 | 1747 | 0.1% |
crs_elapsed_time
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 546 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 141.1707246 |
| Minimum | 21 |
|---|---|
| Maximum | 703 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 60 |
| Q1 | 88 |
| median | 122 |
| Q3 | 171 |
| 95-th percentile | 304 |
| Maximum | 703 |
| Range | 682 |
| Interquartile range (IQR) | 83 |
Descriptive statistics
| Standard deviation | 73.39593725 |
|---|---|
| Coefficient of variation (CV) | 0.5199090495 |
| Kurtosis | 2.308242928 |
| Mean | 141.1707246 |
| Median Absolute Deviation (MAD) | 39 |
| Skewness | 1.431518039 |
| Sum | 101832514 |
| Variance | 5386.963604 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 13152 | 1.8% |
| 85 | 13078 | 1.8% |
| 80 | 12447 | 1.7% |
| 70 | 11571 | 1.6% |
| 75 | 10669 | 1.5% |
| 65 | 9956 | 1.4% |
| 95 | 9716 | 1.3% |
| 110 | 9325 | 1.3% |
| 100 | 9165 | 1.3% |
| 105 | 8807 | 1.2% |
| Other values (536) | 613457 |
| Value | Count | Frequency (%) |
| 21 | 22 | |
| 22 | 14 | |
| 23 | 19 | |
| 24 | 6 | < 0.1% |
| 25 | 1 | < 0.1% |
| 26 | 4 | < 0.1% |
| 27 | 5 | < 0.1% |
| 30 | 9 | < 0.1% |
| 31 | 30 | |
| 32 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 703 | 1 | < 0.1% |
| 695 | 6 | |
| 690 | 5 | |
| 683 | 4 | |
| 681 | 8 | |
| 679 | 3 | < 0.1% |
| 675 | 1 | < 0.1% |
| 672 | 5 | |
| 670 | 3 | < 0.1% |
| 658 | 5 |
actual_elapsed_time
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 637 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 13447 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 136.4907592 |
| Minimum | 16 |
|---|---|
| Maximum | 739 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 56 |
| Q1 | 83 |
| median | 118 |
| Q3 | 167 |
| 95-th percentile | 297 |
| Maximum | 739 |
| Range | 723 |
| Interquartile range (IQR) | 84 |
Descriptive statistics
| Standard deviation | 73.15503209 |
|---|---|
| Coefficient of variation (CV) | 0.5359705851 |
| Kurtosis | 2.288041661 |
| Mean | 136.4907592 |
| Median Absolute Deviation (MAD) | 39 |
| Skewness | 1.417156948 |
| Sum | 96621399 |
| Variance | 5351.658721 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 79 | 6088 | 0.8% |
| 77 | 5955 | 0.8% |
| 80 | 5953 | 0.8% |
| 82 | 5908 | 0.8% |
| 81 | 5890 | 0.8% |
| 85 | 5881 | 0.8% |
| 76 | 5859 | 0.8% |
| 78 | 5822 | 0.8% |
| 75 | 5783 | 0.8% |
| 83 | 5706 | 0.8% |
| Other values (627) | 649052 | |
| (Missing) | 13447 | 1.9% |
| Value | Count | Frequency (%) |
| 16 | 1 | < 0.1% |
| 17 | 2 | < 0.1% |
| 18 | 5 | |
| 19 | 5 | |
| 20 | 4 | < 0.1% |
| 21 | 4 | < 0.1% |
| 22 | 4 | < 0.1% |
| 23 | 10 | |
| 24 | 8 | |
| 25 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 739 | 1 | < 0.1% |
| 716 | 1 | < 0.1% |
| 715 | 1 | < 0.1% |
| 714 | 1 | < 0.1% |
| 701 | 1 | < 0.1% |
| 687 | 3 | |
| 686 | 2 | |
| 679 | 1 | < 0.1% |
| 676 | 1 | < 0.1% |
| 674 | 3 |
air_time
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 614 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 13447 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 111.5125873 |
| Minimum | 7 |
|---|---|
| Maximum | 688 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 34 |
| Q1 | 60 |
| median | 92 |
| Q3 | 141 |
| 95-th percentile | 270 |
| Maximum | 688 |
| Range | 681 |
| Interquartile range (IQR) | 81 |
Descriptive statistics
| Standard deviation | 71.13892974 |
|---|---|
| Coefficient of variation (CV) | 0.6379452892 |
| Kurtosis | 2.323422908 |
| Mean | 111.5125873 |
| Median Absolute Deviation (MAD) | 38 |
| Skewness | 1.441227262 |
| Sum | 78939426 |
| Variance | 5060.747325 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62 | 6608 | 0.9% |
| 63 | 6565 | 0.9% |
| 61 | 6530 | 0.9% |
| 60 | 6481 | 0.9% |
| 58 | 6433 | 0.9% |
| 64 | 6379 | 0.9% |
| 65 | 6360 | 0.9% |
| 59 | 6356 | 0.9% |
| 45 | 6231 | 0.9% |
| 57 | 6205 | 0.9% |
| Other values (604) | 643749 | |
| (Missing) | 13447 | 1.9% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 8 | 6 | < 0.1% |
| 9 | 14 | < 0.1% |
| 10 | 11 | < 0.1% |
| 11 | 10 | < 0.1% |
| 12 | 8 | < 0.1% |
| 13 | 14 | < 0.1% |
| 14 | 40 | < 0.1% |
| 15 | 105 | |
| 16 | 164 |
| Value | Count | Frequency (%) |
| 688 | 1 | |
| 687 | 1 | |
| 678 | 1 | |
| 675 | 1 | |
| 666 | 1 | |
| 665 | 1 | |
| 659 | 1 | |
| 657 | 2 | |
| 654 | 1 | |
| 650 | 1 |
| Distinct | 1536 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 800.3805147 |
| Minimum | 31 |
|---|---|
| Maximum | 4983 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 31 |
|---|---|
| 5-th percentile | 164 |
| Q1 | 363 |
| median | 632 |
| Q3 | 1034 |
| 95-th percentile | 2176 |
| Maximum | 4983 |
| Range | 4952 |
| Interquartile range (IQR) | 671 |
Descriptive statistics
| Standard deviation | 598.6037086 |
|---|---|
| Coefficient of variation (CV) | 0.7478989026 |
| Kurtosis | 2.44681817 |
| Mean | 800.3805147 |
| Median Absolute Deviation (MAD) | 315 |
| Skewness | 1.477416356 |
| Sum | 577349682 |
| Variance | 358326.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 337 | 5057 | 0.7% |
| 733 | 3597 | 0.5% |
| 296 | 3486 | 0.5% |
| 594 | 3032 | 0.4% |
| 214 | 2968 | 0.4% |
| 399 | 2946 | 0.4% |
| 447 | 2899 | 0.4% |
| 404 | 2874 | 0.4% |
| 867 | 2791 | 0.4% |
| 588 | 2717 | 0.4% |
| Other values (1526) | 688977 |
| Value | Count | Frequency (%) |
| 31 | 64 | < 0.1% |
| 41 | 15 | < 0.1% |
| 55 | 8 | < 0.1% |
| 66 | 178 | < 0.1% |
| 67 | 416 | |
| 68 | 135 | < 0.1% |
| 69 | 157 | < 0.1% |
| 70 | 15 | < 0.1% |
| 73 | 522 | |
| 74 | 464 |
| Value | Count | Frequency (%) |
| 4983 | 75 | |
| 4962 | 79 | |
| 4817 | 34 | |
| 4502 | 77 | |
| 4243 | 77 | |
| 4184 | 39 | |
| 3972 | 45 | |
| 3904 | 66 | |
| 3847 | 4 | < 0.1% |
| 3801 | 82 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 721344 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 470037 | |
| 1 | 251307 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 470037 | |
| 1 | 251307 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 470037 | |
| 1 | 251307 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 721344 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 470037 | |
| 1 | 251307 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 721344 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 470037 | |
| 1 | 251307 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 721344 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 470037 | |
| 1 | 251307 |
day
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| Monday | |
|---|---|
| Friday | |
| Thursday | |
| Wednesday | |
| Tuesday | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.119804143 |
| Min length | 6 |
Characters and Unicode
| Total characters | 5135828 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sunday |
|---|---|
| 2nd row | Sunday |
| 3rd row | Friday |
| 4th row | Sunday |
| 5th row | Tuesday |
Common Values
| Value | Count | Frequency (%) |
| Monday | 108362 | |
| Friday | 107919 | |
| Thursday | 107062 | |
| Wednesday | 104693 | |
| Tuesday | 103191 | |
| Sunday | 101932 | |
| Saturday | 88185 |
Length
Pie chart
| Value | Count | Frequency (%) |
| monday | 108362 | |
| friday | 107919 | |
| thursday | 107062 | |
| wednesday | 104693 | |
| tuesday | 103191 | |
| sunday | 101932 | |
| saturday | 88185 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 826037 | |
| a | 809529 | |
| y | 721344 | |
| u | 400370 | |
| n | 314987 | 6.1% |
| s | 314946 | 6.1% |
| e | 312577 | 6.1% |
| r | 303166 | 5.9% |
| T | 210253 | 4.1% |
| S | 190117 | 3.7% |
| Other values (7) | 732502 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4414484 | |
| Uppercase Letter | 721344 | 14.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 826037 | |
| a | 809529 | |
| y | 721344 | |
| u | 400370 | |
| n | 314987 | 7.1% |
| s | 314946 | 7.1% |
| e | 312577 | 7.1% |
| r | 303166 | 6.9% |
| o | 108362 | 2.5% |
| i | 107919 | 2.4% |
| Other values (2) | 195247 | 4.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 210253 | |
| S | 190117 | |
| M | 108362 | |
| F | 107919 | |
| W | 104693 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5135828 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 826037 | |
| a | 809529 | |
| y | 721344 | |
| u | 400370 | |
| n | 314987 | 6.1% |
| s | 314946 | 6.1% |
| e | 312577 | 6.1% |
| r | 303166 | 5.9% |
| T | 210253 | 4.1% |
| S | 190117 | 3.7% |
| Other values (7) | 732502 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5135828 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| d | 826037 | |
| a | 809529 | |
| y | 721344 | |
| u | 400370 | |
| n | 314987 | 6.1% |
| s | 314946 | 6.1% |
| e | 312577 | 6.1% |
| r | 303166 | 5.9% |
| T | 210253 | 4.1% |
| S | 190117 | 3.7% |
| Other values (7) | 732502 |
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.7341823 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 14 |
| median | 27 |
| Q3 | 39 |
| 95-th percentile | 50 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.8156822 |
|---|---|
| Coefficient of variation (CV) | 0.5541849767 |
| Kurtosis | -1.163709087 |
| Mean | 26.7341823 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.00966330638 |
| Sum | 19284542 |
| Variance | 219.504439 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25 | 14969 | 2.1% |
| 28 | 14916 | 2.1% |
| 31 | 14875 | 2.1% |
| 26 | 14829 | 2.1% |
| 29 | 14810 | 2.1% |
| 24 | 14787 | 2.0% |
| 30 | 14736 | 2.0% |
| 32 | 14716 | 2.0% |
| 33 | 14585 | 2.0% |
| 23 | 14357 | 2.0% |
| Other values (43) | 573764 |
| Value | Count | Frequency (%) |
| 1 | 13101 | |
| 2 | 12790 | |
| 3 | 12547 | |
| 4 | 12693 | |
| 5 | 12557 | |
| 6 | 12997 | |
| 7 | 12960 | |
| 8 | 13452 | |
| 9 | 13350 | |
| 10 | 13877 |
| Value | Count | Frequency (%) |
| 53 | 1678 | 0.2% |
| 52 | 13213 | |
| 51 | 14120 | |
| 50 | 13096 | |
| 49 | 13304 | |
| 48 | 13816 | |
| 47 | 12990 | |
| 46 | 13965 | |
| 45 | 13711 | |
| 44 | 13325 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | fl_date | op_carrier | origin | dest | crs_dep_time | dep_delay | taxi_out | wheels_off | wheels_on | taxi_in | crs_arr_time | arr_delay | cancelled | diverted | crs_elapsed_time | actual_elapsed_time | air_time | distance | delayed | day | week | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2832825 | 2018-05-27 | AS | SFO | LAX | 2105 | -3.0 | 16.0 | 2118.0 | 2211.0 | 10.0 | 2232 | -11.0 | 0.0 | 0.0 | 87.0 | 79.0 | 53.0 | 337.0 | 0 | Sunday | 21 |
| 1 | 241995 | 2018-01-14 | EV | ORD | SGF | 1555 | -2.0 | 15.0 | 1608.0 | 1718.0 | 8.0 | 1744 | -18.0 | 0.0 | 0.0 | 109.0 | 93.0 | 70.0 | 438.0 | 0 | Sunday | 02 |
| 2 | 339699 | 2018-01-19 | OO | DEN | SUN | 1130 | -4.0 | 14.0 | 1140.0 | NaN | NaN | 1334 | NaN | 0.0 | 1.0 | 124.0 | NaN | NaN | 557.0 | 0 | Friday | 03 |
| 3 | 5542533 | 2018-10-07 | HA | LIH | HNL | 1042 | -7.0 | 8.0 | 1043.0 | 1103.0 | 7.0 | 1119 | -9.0 | 0.0 | 0.0 | 37.0 | 35.0 | 20.0 | 102.0 | 0 | Sunday | 40 |
| 4 | 27344 | 2018-01-02 | WN | MCO | ALB | 1335 | 2.0 | 17.0 | 1354.0 | 1613.0 | 2.0 | 1615 | 0.0 | 0.0 | 0.0 | 160.0 | 158.0 | 139.0 | 1073.0 | 0 | Tuesday | 01 |
| 5 | 1210004 | 2018-03-07 | OO | MLI | MSP | 645 | -8.0 | 33.0 | 710.0 | 809.0 | 9.0 | 811 | 7.0 | 0.0 | 0.0 | 86.0 | 101.0 | 59.0 | 274.0 | 1 | Wednesday | 10 |
| 6 | 4985895 | 2018-09-09 | DL | JAC | SLC | 700 | 1.0 | 14.0 | 715.0 | 753.0 | 5.0 | 805 | -7.0 | 0.0 | 0.0 | 65.0 | 57.0 | 38.0 | 205.0 | 0 | Sunday | 36 |
| 7 | 3864175 | 2018-07-16 | DL | STT | ATL | 1442 | 2.0 | 11.0 | 1455.0 | 1809.0 | 10.0 | 1843 | -24.0 | 0.0 | 0.0 | 241.0 | 215.0 | 194.0 | 1599.0 | 0 | Monday | 29 |
| 8 | 7439 | 2018-01-01 | OO | TUS | LAX | 910 | -10.0 | 17.0 | 917.0 | 932.0 | 7.0 | 1004 | -25.0 | 0.0 | 0.0 | 114.0 | 99.0 | 75.0 | 451.0 | 0 | Monday | 01 |
| 9 | 799119 | 2018-02-13 | WN | PHX | OAK | 1300 | 10.0 | 9.0 | 1319.0 | 1356.0 | 4.0 | 1405 | -5.0 | 0.0 | 0.0 | 125.0 | 110.0 | 97.0 | 646.0 | 0 | Tuesday | 07 |
Last rows
| df_index | fl_date | op_carrier | origin | dest | crs_dep_time | dep_delay | taxi_out | wheels_off | wheels_on | taxi_in | crs_arr_time | arr_delay | cancelled | diverted | crs_elapsed_time | actual_elapsed_time | air_time | distance | delayed | day | week | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 721334 | 6065437 | 2018-11-02 | OO | SBP | LAX | 1657 | -9.0 | 20.0 | 1708.0 | 1745.0 | 14.0 | 1810 | -11.0 | 0.0 | 0.0 | 73.0 | 71.0 | 37.0 | 156.0 | 0 | Friday | 44 |
| 721335 | 1353960 | 2018-03-14 | WN | MDW | OAK | 2015 | -2.0 | 10.0 | 2023.0 | 2229.0 | 6.0 | 2255 | -20.0 | 0.0 | 0.0 | 280.0 | 262.0 | 246.0 | 1844.0 | 0 | Wednesday | 11 |
| 721336 | 3709250 | 2018-07-09 | WN | PHX | CMH | 1920 | 152.0 | 6.0 | 2158.0 | 433.0 | 6.0 | 155 | 164.0 | 0.0 | 0.0 | 215.0 | 227.0 | 215.0 | 1670.0 | 1 | Monday | 28 |
| 721337 | 2848071 | 2018-05-28 | AA | BOS | LGA | 1600 | -5.0 | 16.0 | 1611.0 | 1653.0 | 11.0 | 1724 | -20.0 | 0.0 | 0.0 | 84.0 | 69.0 | 42.0 | 184.0 | 0 | Monday | 22 |
| 721338 | 1465806 | 2018-03-20 | B6 | FLL | SYR | 2005 | 23.0 | 53.0 | 2121.0 | 2353.0 | 2.0 | 2307 | 48.0 | 0.0 | 0.0 | 182.0 | 207.0 | 152.0 | 1197.0 | 1 | Tuesday | 12 |
| 721339 | 4237444 | 2018-08-03 | AA | CLT | MCO | 2210 | NaN | NaN | NaN | NaN | NaN | 2345 | NaN | 1.0 | 0.0 | 95.0 | NaN | NaN | 468.0 | 0 | Friday | 31 |
| 721340 | 660476 | 2018-02-06 | EV | DFW | LBB | 845 | -5.0 | 9.0 | 849.0 | 937.0 | 4.0 | 958 | -17.0 | 0.0 | 0.0 | 73.0 | 61.0 | 48.0 | 282.0 | 0 | Tuesday | 06 |
| 721341 | 705694 | 2018-02-08 | WN | SMF | LAX | 1405 | -7.0 | 14.0 | 1412.0 | 1511.0 | 7.0 | 1530 | -12.0 | 0.0 | 0.0 | 85.0 | 80.0 | 59.0 | 373.0 | 0 | Thursday | 06 |
| 721342 | 1950105 | 2018-04-13 | WN | EWR | OAK | 1710 | 31.0 | 30.0 | 1811.0 | 2057.0 | 6.0 | 2025 | 38.0 | 0.0 | 0.0 | 375.0 | 382.0 | 346.0 | 2555.0 | 1 | Friday | 15 |
| 721343 | 6199338 | 2018-11-09 | DL | PHX | MSP | 915 | -4.0 | 12.0 | 923.0 | 1305.0 | 4.0 | 1332 | -23.0 | 0.0 | 0.0 | 197.0 | 178.0 | 162.0 | 1276.0 | 0 | Friday | 45 |